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BACKGROUND OF THE INVENTION 

5 

1 . Field of the Invention 

This invention relates to methods and apparatus for processing modem signals in 
networks and more particularly to methods and apparatus for processing modem 
signals in networks having voice compression. 

10 

2. Description of Related Art 

Modems are commonly used to transmit digital data signals d(n) over digital or analog 
voice networks. A first modem connected to a pomt of entry of a voice network 
generates an analog voice signal s(t) containing the digital data signal d(n) by 

15 modulating the digital data signal d(n) using known modulation techniques such as 
V.22bis and V.32bis modulation techniques. A second modem connected to a point of 
exit of the voice network converts or demodulates the transmitted analog voice signal 
s'(t) into a digital data signal d'(n) representing the original digital data signal d(n) 
subject to data loss. In digital voice networks, one source of data loss may be packet 

20 loss, i.e., one or more digital packets representing the voice signal s(t) may be lost 
during transmission from the first modem to the second modem across the digital 
voice network. 



Another source of data loss may be due to digital voice compression of the voice 
signal s(t) as it is converted to a digital voice signal s(n) and transmitted over the 
digital voice network. A normal voice signal s(t), when digitized s(n), uses a 
bandwidth of 64kbps for transmission across a digital network. This bandwidth 
enables the transmission of modem signals as well as voice signals with very low 
distortion. As noted, however, may digital voice networks compress voice signals s(t) 
prior to transmission as digital packets containing a digital signal s(n) representing the 
voice signal s(t). FIGURE 1 is a block diagram of a prior art transmission system for 
transmitting voice signals s(t). 

As shown m FIGURE 1, the system includes a digital voice encoder 2, digital voice 
network 30, and digital voice decoder 4. The digital voice encoder 2 converts a 
received voice (or modem signal) s(t) into a plurality of digital packets representing a 
digital version s(n) of the voice signal. The digital voice network 30 transports the 
plurality of digital packets to the voice decoder 4. As noted above, the digital voice 
network 30 may be a lossy system, i.e., one or more of the plurality of digital packets 
may not be delivered or delivered out of order. Voice decoder 4 receives the digital 
packets representing a digital voice signal s'(n) from the digital voice network 30 and 
generates a voice/modem signal s'(t) from the received digital packets. 

As shown in FIGURE 1, the voice encoder may include a voice compression unit 16. 
The voice compression unit attempts to remove redundant speech information from 
the received speech signal s(t). As noted above, digital speech signals s(n) are 
commonly 64bps signals prior to compression. After compression, the digital speech 
signals s(n) may be 8kbps or less. As noted, the compression unit 16 attempts to 
remove redundant speech information and consequently may distort or corrupt modem 
signals. Some compression units may distort or corrupt the modem signals so severely 
that the modulated signal can not be demodulated without losses. 



To counter for or reduce such error losses, the data rate of a digital signal d(n) 
modulated into a voice signal s(t) to be transmitted by a network having voice 
compression may be reduced. For example, the maximum data rate of a digital signal 
d(n) modulated into a voice signal s(t) that is processed by a compression unit 16 using 

5 the well known, low distortion ADPCM (G.721) 32kpbs compression algorithm is 
4800bps. Thus, even though the compressed digital voice signal s(n) is a 32kpbs 
signal, only a 4800bps digital signal d(n) can be transmitted across a digital voice 
transmission system 5 employing such a compression unit. This is a highly inefficient 
use of bandwidth considering the voice compression unit 16 is added to reduce 

10 bandwidth consumption. Thus, a need exists for a digital voice transmission system 5 
that more efficiently transmits modulated digital signals as well as compressed voice 
signals. 



V 
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SUMMARY OF THE INVENTION 

The present invention is a method and an apparatus of transmitting a modulated data 
signal over a digital voice network that includes voice compression. The method and 

5 apparatus determines whether a voice signal to be transmitted over the network 
includes a modulated data signal and then encodes or demodulates the modulated data 
signal and bypasses voice compression when a modulated data signal is detected. The 
method and apparatus may detect a modulated data signal by detecting the presence of 
an answer tone and then searching for a plurality of phase reversals having a particular 

10 time distance from each other in the voice signal. Once the encoded or demodulated 
signal is transmitted over the digital voice network, the method and apparatus of the 
present invention may detect whether the received signal is a coded data signal or a 
demodulated data signal. Then the encoded data signal may be decoded back into a 
modulated data signal. Likewise, the demodulated data signal may be modulated back 

1 5 into a modulated data signal. 

In one embodiment the invention is a method of determining whether a voice signal 
includes a modulated data signal. In this embodiment, the method first determines 
whether an answer tone is present in the voice signal. When an answer tone is 

20 detected, the method then locates or searches for a first phase reversal in the voice 
signal. Upon the detection of the first phase reversal, the method locates or searches 
for a second phase reversal in the voice signal, the second phase signal being the next 
consecutive phase reversal in the voice signal after the first phase reversal. The time 
interval between the location of the first phase reversal in the signal and the second 

25 phase reversal in the voice signal is then determined and compared to a predetermined 
range of time values. When the time interval is between the predetermined range of 
time values, the method indicated that the voice signal includes a modulated data 
signal. 
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In an exemplary embodiment, the step of determining whether a first or second phase 
reversal is present in the voice signal includes first determining the autocorrelation of a 
segment of the voice signal and searching the autocorrelation of the segment of the 
voice signal for its lowest value. The lowest autocorrelation value is compared to a 
predetermined value and a phase reversal is considered to be present in the segment 
when the lowest autocorrelation value is less than the predetermined value. 
In a further embodiment, the invention locates or searches for a third phase reversal in 
the voice signal, the third phase signal being the next consecutive phase reversal in the 
voice signal after the second phase reversal when the time interval between the first 
and second phase reversal is not between the predetermined time interval. The 
invention mdicates that the voice signal includes a modulated data signal when the 
determined time interval between the location of the second phase reversal in the 
signal and the third phase reversal in the voice signal is between the predetermined 
range of time values. Otherwise, the invention indicates that the voice signal does not 
include a modulated data signal. 

In a fiarther preferred embodiment of the invention the step of determining whether an 
answer tone is present in the voice signal includes selecting a segment of the voice 
signal and converting the segment of the voice signal from a time domain signal to a 
fi-equency domain signal. Then the energy of a bin of the firequency domain signal that 
corresponds to the fi-equency of the answer tone is compared to the energy of 
neighboring or adjacent fi-equency bins. The voice signal is considered to have the 
answer tone when the energy of the frequency bin corresponding to the frequency of 
the answer tone is greater than the energy of the adjacent frequency bins. It is noted 
that the frequency content of the segment of the voice signal may be reduced by down 
sampling prior to conversation to the frequency domain in order to reduce 
computations. 



The present invention also includes a method of packetizing a digital voice signal for 
transmission over a digital voice network where the voice signal may include a 
modulated data signal. The method determines whether the voice signal includes a 
modulated data signal as described above, hi one embodiment when it is determined 

5 that the voice signal includes a modulated data signal, the voice signal is demodulated, 
generatmg a digital data signal, hi another embodiment when it is determined that the 
voice signal includes a modulated data signal, the voice signal is encoded into a digital 
data signal comprising the modulated data signal by linearly quantizing the voice 
signal, hi either case the resultant digital data signal is packetized into a plurality of 

1 0 digital packets for transmission over the digital voice network. 

In one embodiment, when it is determined that the voice signal does not include a 
modulated data signal, the voice signal is compressed packetized into a plurality of 
digital packets for transmission over the digital voice network. It is noted that the step 

15 of Imearly quantizing the voice signal to generate the digital data signal may include 
normalizing the voice signal and convertmg the normalized voice signal into a digital 
floating point signal. Further the step of normalizing the voice signal may include 
finding the sample of the voice signal having the maximum value and normalizing the 
sample of the voice signal having the maximum value by left shifting the sample until 

20 all redundant sign bits are eliminated where NLS is the number of left shifts. Then all 
the samples of the voice signal may be right shifted as a ftmction of NLS. In an 
exemplary embodiment, all the samples of the voice signal may be right shifted as a 
ftmction of NLS and the bit precision of the mantissa of the floating point signal. 
Further, all the samples of the voice signal may be right shifted by 16 - NLS - Njprec 

25 where N _prec is the bit precision of the mantissa of the floating point signal. In this 
embodmient, N_prec may be set to 4 for 24.8 bits per second (BPS) data signals, 5 for 
30.8 BPS data signals, 6 for 36.8 BPS data signals, and 7 for 42.8 BPS data signals. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



FIGURE 1 is a block diagram of a prior art digital voice transmission system including 
voice compression. 

5 

FIGURE 2 is a block diagram of a first exemplary digital voice transmission system in 
accordance with the present invention. 

FIGURE 3 is a block diagram of a second exemplary digital voice transmission system 
1 0 in accordance with the present invention. 

FIGURE 4 is a block diagram of an exemplary modem signal detector in accordance 
with the present invention where the detector may be employed in the digital voice 
transmission system shown in FIGURE 2 or 3. 

15 

FIGURE 5 is a block diagram of an exemplary modem signal encoder in accordance 
with the present invention where the encoder may be employed in the digital voice 
transmission system shown in FIGURE 3. 

20 FIGURE 6 is a block diagram of an exemplary modem signal decoder in accordance 
with the present invention where the decoder may be employed in the digital voice 
transmission system shown in FIGURE 3. 

25 Like reference numbers and designations in the various drawings indicate like 
elements. 
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DETAILED DESCRIPTION OF THE INVENTION 

Throughout this description, the preferred embodiment and examples shown should be 
considered as exemplars, rather than as limitations on the present invention. 

5 

As noted above, prior art digital voice transmission systems 5 (as shown in FIGURE 
1), do not efficiently use the bandwidth of the digital voice network 30 when 
transmitting modulated digital data signals. FIGURE 2 is a diagram of a first 
exemplary digital voice transmission system 40 that more efficiently transmits 

10 modulated digital data signals. As shown in FIGURE 2, the system 40 includes a 
digital voice signal encoder 10, a digital voice network 30, and a digital voice signal 
decoder 20. The system 40 relays digital data signals d(n) by first detecting the modem 
signal s(t) at the entry point to the voice encoder 10. In particular, the modem signal 
detector 12 analyzes received voice/modem signals s(t) to determine if the signal is a 

15 modulated data signal. An exemplary signal detector 12 is described in detail below 
with reference to FIGURE 4. 

When a modem signal is detected, the demodulator 14 converts or demodulates the 
signal s(t) back to the underlying digital data signal d(n) using techniques known to 

20 those of skill in the art. Otherwise, the voice compression unit 16 compresses the 

signal s(t) by removing redundant data and converts the signal to a digital signal s(n). 
Then, the packet generator 18 converts either the digital data -signal d(n) or the digital 
voice signal s(n) to a plurality of packets as described in more detail below. The packet 
generator 18 may add header information to one or more of the plurality of packets to 

25 indicate whether the packets represent a voice or data signal. The header information 
may also indicate the type of modulation used to origmally form the s(t) signal when 
the packets represent digital data. The plurality of digital packets is transmitted across 
the digital voice network 30 to the digital voice signal decoder 20. 
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The packet degenerator 28 of the decoder 20 first converts the received packets back 
into a digital signal s(n), A modem signal detector 22 determines whether the digital 
signal represents a digital voice signal s(n) or a digital data signal d(n). As described 
below^ in more detail and noted above, a header bit in one or more of the packets may 

5 indicate v^hether the digital signal is a voice or data signal When the modem signal 

detector 22 determines the digital signal represents a digital data signal, the modulator 
24 remodulates the digital data signal back into a modem signal s(t) using techniques 
known to those of skill in the art. As noted above, one or more of the packets may 
have a header that includes information which indicates the type of modem 

10 compression originally present in the signal s(t) prior to demodulation by the 
demodulator 14. Otherwise, when the digital signal represents a compressed voice 
signal s(n), the voice decompression unit 26 decompresses the signal using techniques 
known to those of skill in the art. The resultant signal s'(t) represents the original 
signal s(t) presented to the digital voice transmission system 40 provided no packets 

1 5 are lost and not recovered. 

The relying technique used in the system 40 is most efficient use of bandwidth of the 
system since no bandwidth is wasted as described above when modem signals are 
processed by voice compression units. The system 40, however, is computationally 

20 complex because it adds a modulator 24 and demodulator 14 to the system 40. 

Depending on the modulation technique used to generate the modem signal s(t), the 
additional computational complexity, memory constraints, processor speed or other 
chip level issues may prevent the implementation of the techniques. Consequently, a 
compromise between bandwidth and computational complexity, memory, or processor 

25 speed may be required for different system implementations. 
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FIGURE 3 presents a second exemplary digital voice transmission system 70 in 
accordance with the present invention that employs a modem encoder 54 and decoder 
64 in place of a demodulator 14 and modulator 24. As explained below, the system 70 
uses more bandwidth to transmit a signal than the system 40 but is less 
computationally complex and takes considerably less time to design and implement. 
For example the system 40 requires 9.6kbps of network bandwidth to send a V.32bis 
9600bps signal while a transparent modem encoder of system 70 might use as much as 
30kbps of network bandwidth. Developing a modulator 24 and demodulator 14 for 
V.32bis might take 6 man months, however, whereas developing a modem encoder 54 
and decoder 64 for system 70 would probably take only 2 man weeks. 

As shown in FIGURE 3, the digital voice transmission system 70 mcludes a digital 
voice signal encoder 50, a digital voice network 30, and a digital voice signal decoder 
60. The system 70 codes digital data signals d(n) by first detecting the modem signal 
s(t) at the entry point to the voice encoder 50. As for the system 40, the modem signal 
detector 12 analyzes received voice/modem signals s(t) to determine when the signal is 
a modulated data signal. An exemplary signal detector 12 is described in detail below 
with reference to FIGURE 4. 

When a modem signal is detected, a modem signal encoder 54 codes the signal s(t) 
into a coded modem signal Xnet(n) using a process and apparatus described in detail 
below with reference to FIGURE 5. Otherwise, the voice compression unit 16 
compresses the voice signal s(t) by removing redundant data and converts the signal to 
a digital signal s(n). Then, the packet generator 18 converts either the coded modem 
signal Xnet(n) or the compressed digital voice signal s(n) into a plurality of packets as 
described in more detail below. The packet generator 18 may add header information 
to one or more of the plurality of packets to indicate whether the packets represent a 
voice or data signal. The header information may also indicate the type of modem 
coding used to generate the coded modem signal Xnet(n). The plurality of digital 
packets is transmitted across the digital voice network 30 to the digital voice signal 
decoder 60. 
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The packet degenerator 28 of the decoder 60 first converts the received packets back 
into a digital signal s(n). A modem signal detector 22 determines whether the digital 
signal represents a digital voice signal s(n) or a coded modem signal Xnet(n). As noted 

5 above, a header bit in one or more of the packets may indicate whether the digital 
signal is a voice or data signal. When the modem signal detector 22 determines the 
digital signal represents a coded modem signal, the modem signal decoder 64 converts 
the coded modem signal x„et(n) back into a modem signal s(t) as described in more 
detail below with reference to FIGURE 6. As noted above, one or more of the packets 

10 may have a header that includes information, which indicates the type of coding 
performed by the modem signal encoder 54. Otherwise, when the digital signal 
represents a compressed voice signal s(n), the voice decompression imit 26 
decompresses the signal using techniques known to those of skill in the art. The 
resultant signal s'(t) represents the original signal s(t) presented to the digital voice 

1 5 transmission system 70 provided no packets are lost and not recovered. 

As noted above, the two exemplary systems 40 and 70 each include a modem signal 
detector 12 that determines whether a signal s(t) includes a modem signal. An 
exemplary apparatus and method of determining whether a signal s(t) is a modem 

20 signal is presented with reference to FIGURE 4. As shown in FIGURE 4, the modem 
signal detector 12 includes a tone detector 82, down sampler 84, autocorrelator 86, and 
decision block 88. In brief, the detector 12 identifies whether the signal s(t) includes a 
modem signal by determining whether a phase reversal that occurs every 450 ms in 
2100 Hz answer tones transmitted by many answering side modems (according to ITU 

25 Recommendation V.25) is present in the signal s(t). As described in more detail below, 
the tone detector 82 first determines whether a 2100 Hz answer tone is present in the 
signal s(t). The dovra sampler 84, autocorrelator 86, and decision block 88 are used to 
determine whether a phase reversal occurs every 450 ms. 
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In one embodiment of the modem signal detector 12 the signal s(t) is first digitally 
sampled at a 8kHz rate to generate a digital signal Xin(n). It is noted that the phase 
reversal of the signal s(t) occurs during the first second of the answer tone in modem 
protocols that incorporate the V.25 signal at the beginning of the modem handshake, 

5 such as V.22bis and V.32bis. Consequently, only signals s(t) generated by modems 

that incorporate the V.25 signal may be identified using one exemplary detector in 
accordance with the present invention. A summary of the exemplary operation of the 
modem signal detector 12 follows: in a preferred embodiment, the modem signal 
detector 12 operates on blocks of 120 or 160 linear samples at a time. The tone 

10 detector 82 first rejects signals outside the range of 2100 Hz +/- 15 Hz, i.e., if the 
signal does not have a tone within this range, no tone is detected. Then, the down 
sampler 84 down samples the sampled signal Xi„(n), producing an aliased signal x<i(n) 
at lOOHz. It is noted that phase reversals will still be detectable in the aliased signal 
Xd(n) since the fi:equency of the phase reversal (2.2Hz) is slow compared with the 

15 aliased signal frequency. The autocorrelator 86 then uses a sliding window to 
calculate successive autocorrelations of die aliased signal. The decision block 88 
detects a phase reversal when the value of the autocorrelation falls below a 
predetermined threshold, normalized to the signal power. When the decision block 88 
recognizes or detects two phase reversals occurring 450ms apart, the block 88 declares 

20 that a modem signal is present. 
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As noted above, the modem signal detector 12 is ideally optimized to detect phase 
reversals in a 2100 Hz tone +/- 15 Hz as outlined in the international V.25 
specification. Given that tones outside the 2100 Hz range may produce false 
detections, the tone detector 82 rejects signals that have tones outside the 2100 Hz +/- 
15 Hz range as not having a modem signal. It is noted that the tone detector 82 may 
employ any method that can reliably detect a 2100 Hz tone over a block of 120 or 160 
samples. An exemplary implementation of the tone detector 82 performs a 512-point 
FFT on the input, Xin and then compares the energy in the 2100 Hz bin with the energy 
outside the 2100 Hz bin. When the energy outside the 2100 Hz bin is 3dB or more 
below the energy in the 2100 Hz band, the tone detector 82 indicates that a 2100 Hz 
tone is detected. It is noted that the tone detector 82 may have difficulty detecting a 
2100 Hz tone during phase reversals because the sudden change in phase can produce 
high frequency energy. Consequently, in the preferred embodiment, the tone detector 
82 will still indicate the present of 2100 Hz after initialing detecting a 2100 Hz tone, 
unless a tone falls outside the 2100 Hz range for 3 consecutive blocks. 

It is noted that phase reversals, when present, occur infrequently (at a rate of 2.2Hz) 
when compared to the 8 kHz sampling rate of Xin(n). Consequently, the preferred 
modem signal detector 12 includes a down sampler 84 to reduce the number of 
samples, which, m turn reduces the amount of memory used by the modem signal 
detector 12 and the calculations needed to determine whether a modem signal is 
present. The down sampler 84 may generate aliased fi-equency components but 
because the aliased fi-equency components are due to spectral overlap of the original 
signal fi-equencies, phase information is preserved. Consequently, any phase reversals 
will still be detectable in the down sampled signal Xd(n). In the preferred embodiment 
of the invention, the down sampler 84 down samples the signal Xi„(n) by a factor of 
eight, i.e., from a 8 kHz signal to a 1 kHz signal. This changes the 2100 Hz tone into a 
100 Hz aliased tone. The down sampler 84 ideally achieves the down sampling of the 
signal Xi„(n) by taking every 8* sample from the 8kHz digitized signal x,„(n). The 
resultant aliased signal, Xd(n), has a period of 10 ms or 10 samples at the new sampling 
rate. Consequently, the block size of the signal is reduced to 15 or 20 samples. 
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The autocorrelator 86 then operates on the signal Xd(n) to determine whether phase 
reversals occurs at a 450 ms interval. It is known that the autocorrelation of a signal 
provides a measure of the "similarity" between two sections of the signal whose 
starting points are separated or delayed by a particular time interval or "lag". The 

autocorrelation of a signal x(n) is defined as: R(k) = S x(n)x(n+k) where k represents 

the lag factor (delay) and n is the length of each section being compared. It is known 
for periodic signals with period N, x(n) = x(n+N). Consequently, by setting the lag 
factor equal to the period N, the Autocorrelation reduces to the energy in the periodic 

signal: R(N) = E x(n)x(n+N) = E x(n)x(n) = S x(nf = E 

Therefore, the autocorrelation of a signal x(n), normalized to the energy of the signal, is 
equal to 1 for periodic signals when the autocorrelation 1^ is equal to the period of the 
signal. Likewise, the normalized autocorrelation of a periodic signal that contains a 180- 
degree phase shift is equal to -1 since: 

x(n) = - x(n+N) and R(N) = S x(n)x(n+N) = E -x(n)x(n) = - S x(n)^= -E. 
A phase reversal in the middle of a section will result in a normalized autocorrelation value 
falling in between -1 and +1, depending on where the reversal occurs, hi any case, all 
phase reversals will be evident by a drop in the autocorrelation value. 
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Thus according to a preferred embodiment, the autocorrelation is used to detect the 
phase reversal in the aliased signal, Xd, by setting the lag equal to the period (10 ms = 
10 samples) of the 100 Hz aliased signal. Note: the down sampling factor was chosen 
so that the aliased period is made up of an integer number of samples. This selection 

5 helps reduce possible jitter in the autocorrelation value that could be caused by 
variations in sampling location over a period. In addition, in the preferred embodiment 
the autocorrelation of Xd is calculated over a window of 20 samples consisting of two 
adjacent sections made up of 10 samples each (i.e., n=10). Note: a longer window 
would increase the number of calculations. Further, a longer window would also 

10 produce a more smoothed result, reducing the impact of a phase reversal on the 
autocorrelation value, making the phase reversal more difficuh to detect. A shorter 
window may over-emphasize signal noise in the absence of a phase reversal and could 
lead to false detections. 

15 Thus, in the present invention the autocorrelation value will remain at a relatively high 

constant value across windows that do not contain phase reversals. The calculated value 
across a window in which a phase reversal occurs is dependent on where in the window the 
reversal occurs. For example, if Ihe reversal occurs at Ihe 10* sample, on the border 
between the two adjacent signal sections, then the autocorrelation will be very negative 

20 because the first section is 180 degrees out of phase from the second section. This is the 
ideal situation for detecting a phase reversal. If however, the reversal occurs at the l" 
sample, then tiie autocorrelation won't decrease very much because ahnost all the points of 
the two sections are in phase. Consequently, it is very difficult to detect phase reversals 
that occur near the 1"^ sample. Thus, shifting the wmdow by 5 samples between successive 

25 autocorrelation calculations increases the likelihood of the phase reversal occurring 
towards the middle of the window, and thus producing a large drop in the Autocorrelation 
value when such a phase reversal occurs. 
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The resulting Autocorrelation for the modem signal detector is: Rm(10) = 
Exd(n+5m)xd(n+5m+10) where m equals the calculation number and each successive 

calculation shifts tiie window by 5 samples. Since the Autocorrelation is calculated on a 
block basis and down sampling reduces tiie block size to 15 (20) samples, 10 samples from 
tiie previous block are stored for use in the current block calculation. Thus, in tiie preferred 
embodiment, tiie autocorrelation is calculated 2 (3) times per block, witii tiie minimum 
autocorrelation value being passed on to tiie Decision Block 88 for comparison to a 
predetermined value to determine whetiier a phase reversal has occurred. 

As noted above, the decision block receives the minimum autocorrelation value for each 
determination from tiie autocorrelator 86. Li particular, tiie exemplary decision block 
performs two steps in order to determine whetiier a modem signal is present. First, tiie 
minimum autocorrelation value is compared to a predetermmed threshold to determine if a 
phase reversal has occurred. Then, when a second phase reversal is detected, tiie decision 
block 88 determmes whether time between the occurrence of phase reversals is 450 ms 
plus or minus 25 ms. When botii conditions are satisfied, tiie decision block indicates tiiat a 
modem signal is present according to the present invention. 

In tiie exemplary embodiment, tiie predetermined threshold for tiie decision block 88 is set 
to 0.01. This tiireshold was chosen to accommodate any tone in the range of 2100 Hz +/- 
15 Hz at any signal level. It is noted that tiie tiireshold is scaled to match tiie power level to 
normalized signal levels where left shifting tiie tiireshold by the power's normalized 
exponent produces appropriate scaling. Further, tiie power is tiie sum of tiie squares over a 
fitil block of Xin samples. Since tiie autocorrelation is calculated over fewer values tiian the 
power, however, the resulting normalized autocorrelation falls between -0.125 and 0.125. 
Accordingly, a phase reversal is detected when the autocorrelation is less than tiie adjusted 
tiireshold where tiie exemplary predetermined, tiireshold is 0.01 for an autoconelation 
value normalized as described above. 
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As also noted, the decision block 88 indicates a modem signal is present when a first and 
second detected phase reversal occur within 450 ms from each other plus or minus 25 ms. 
Thus, the decision block 88 detects at least two phase reversals before determining or 
deciding when a modem signal is present according to the present invention. In one 

5 embodiment, a timer is started when the decision block 88 determines a first phase reversal 
has occurred. When the decision block determines fliat a second phase reversal has 
occurred and the value of the timer is equal to 450 ms +/-25 ms, the decision block 88 
indicates that a modem signal is present. Otilierwise, when the second phase reversal occurs 
or is detected outside this time range, the timer is reset and a search is conducted for the 

10 next phase reversal. The decision block decides that no modem signal is present when no 
two consecutive phase reversals meet the time criteria before the end of a 2 1 DO Hz tone. 

As noted above in one embodiment of liie invention, the system 70 includes a modem 
signal encoder 54 and a modem signal decoder 64. An exemplary modem signal encoder 

15 54 and modem signal decoder 64 is described below with reference to FIGURES 5 and 6. 

As can be seen from these FIGURES, the modem decoder 64 performs the inverse function 
of the modem signal encoder 54, which is described first below. As shovra in FIGURE 5, 
the exemplary modem signal encoder 54 includes a downsampling unit 92, a block floating 
point unit 94, and a bit packer 96. In the preferred embodiment, the encoder 54 is designed 

20 to operate with a digital signal sampled at a 8kHz rate using 8 bit u-law or A-law 
quantization followed by a conversion to linear 16-bit samples. In this preferred 
embodiment, the encoder 54 may compress this signal at four different rates, including 
24.8kbps, 30.8kbps, 36.8kbps, or 42.8kbps. Ideally, the compression rate is configurable in 
the encoder and is embedded in the transmitted packet (such as in a header as described 

25 above) so the decoder 64 can adjust its operation to the packet size it receives. Further, in 
this preferred embodiment, the exemplary modem encoder 54 is designed to transmit 
V.22bis and V.32bis modem signals. It is noted.that Hie lower two transmission rates of the 
encoder 54 allow modem connections at or below 9.6kbps while the two higher rates allow 
rates as high as 14.4kbps. 

30 
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Similar to the modem signal detector 10, The modem signal encoder 54 operates on 
10ms frames, which yields 80 samples per frame at a 8kHz sampling rate. As noted, 
the exemplary modem signal encoder 54 includes a downsampling unit 92, a block 
floating point unit 94, and a bit packer 96. In the preferred embodiment, the 
5 downsampling unit 92 bandHmits the signal to 3kHz and downsamples the voice 
signal to 6kHz. The block floating point unit 94 normalizes the 6kHz signal and 
converts it to a block floating point format. The bit packer 96 packs the bits of the 
signal generated by the block floating point unit 94 together into bytes to be packetized 
by the packet generator 1 8 and sent to the network 30. 

10 

In a preferred embodiment of the invention, the downsampling unit 92 uses a 
polyphase lowpass FIR filter with 193 taps shown in Table 1 (listed in Q15 format) to 
bandlimit the input signal, Xin, (generated from s(t)) while downsampling the signal 
from 8kHz to 6kHz. The input buffer of the unit 92 consists of 80 samples of the input 

1 5 signal, Xin and the output buffer consists of 60 samples, which represent the signal Xd. 

In the preferred embodiment of the invention, the input signal Xjn is bandlimited and 
downsampled in this way because the modem signals do not have any substantial 
energy above 3kHz. It is noted that ideally the input signal Xi„ is downsampled by 
creating a buffer that is padded with two zeros in between each sample of Xin. The FIR 

20 filter operates on the resulting zero-padded buffer and then every fourth sample output 
from the FIR filter is retained, thereby creating a 60 sample signal Xd. Preferably, the 
retained samples are multiplied by a gain factor of three to compensate for zero 
padding. 
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As noted above, the block floating point unit 94 normalizes the signal xa and converts 
the signal to a floating point signal. In order to first normalize the signal, the imit 94 
first searches Xd for sample having a maximum value, Xdmax- It is noted that the value 

5 Xdmax is a 16-bit linear fixed-point value. This value is normalized to eliminate all 
redundant sign bits. In the preferred embodunent, the value Xdmax is normalized by left 
shiftmg Xdmax until only one sign bit remains. The number of left shifts is saved as the 
value NLS. In normal block floating point conversions the entire buffer would then be 
left shifted by NLS. In this embodiment, the signal is also quantized. Accordingly, the 

10 unit 94 instead right shifts all samples Xd in the buffer by 16-NLS-n_prec where n_prec 
is the bit precision of the mantissa of the resulting floating point number. In the 
preferred embodiment, the mantissa is either 4,5,6, or 7 bits in length corresponding to 
the rate configuration of 24.8kbps, 30.8kbps, 36.8kbps, or 42.8kbps respectively. The 
shifting of the samples linearly quantizes the samples Xd to njprec bits where the 

1 5 quantization levels are effectively determined by the maximum value of Xd. 

The output of the block floating point unit is a signal Xb, which consists of 60 samples 
where each sample has n_prec significant bits. The signal Xb also includes an 8-bit 
sample consisting of the value of NLS "OR"ed with njpvec and left shifted by 4. 
20 Accordingly, the signal Xb consists of pkt size bits where pkt_size = 60*(njprec)+8. 

The bit packer 96 packs the bits of the signal Xb bytes of 8 bits where the bytes are 
packetized by the packet generator 18. ft is noted that the first 8-bit byte of the 
resulting signal Xnet consists of the 8-bit samples having the value of NLS "OR"ed 
with n_prec and left shifted by 4. 

25 
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As noted above, the decoder 64 performs the inverse process of the encoder 54. An 
exemplary decoder 54 is shown in FIGURE 6. As shown in FIGURE 6, the decoder 64 
includes a bit impacker 97, a block floating point exponential unit 95, and an up- 
sampling unit 93. The bit unpacker 97 reverses the process that performed in the bit- 

5 packing unit 96 in the encoder 54. The number of left shifts, NLS, and the bit 
precision number, njrec are stored in the first 8-bit byte of the coded modem signal 
Xnet- The remaining bytes of the coded modem signal Xnet are unpacked in n__prec bit 
blocks and into separate words in memory that reproduces the 60 word sample, Xb. 
Using NLS and n_prec values, the block floating point exponential unit 95 left shifts 

10 the buffer, Xb, by 16-NLS-nj)rec places and zero fills the LSB during shifting. 
Accordingly, the block floating point exponential unit 95 produces the buffer, Xd. 

Finally, the up-sampling unit 93 uses the same polyphase FIR filter used by the 
downsampling unit 92 to up-sample the Xd buffer to produce the Xin buffer. In 
15 particular, the up-sampling unit 93 creates a zero-padded buffer where three zeros are 
inserted between each sample of Xd. Then, FIR filter operates on the resulting buffer 
and every third output value is saved to generate 80 samples. The value of the 80 
samples is then multiplied by a gain factor of 4 to compensate for the zero padding 
process. 

20 

It is noted that variations of the above techniques are possible. In particular, other 
techniques may be used to detect the present of the answer tone and phase reversals. 
Consequently, the invention is not to be limited by the specific illustrated embodiment, 
but only by the scope of the appended claims. 
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CLAIMS 

What is claimed is: 



1 1. A method of determining whether a voice signal includes a modulated data 

2 signal, the method comprising the steps of: 

3 a) determining whether an answer tone is present in the voice signal; 

4 b) when an answer tone is present, performing the steps of: 

5 i) locating a first phase reversal in the voice signal; 

6 ii) locating a second phase reversal in the voice signal, the second 

7 phase signal being the next consecutive phase reversal in the 

8 voice signal after the first phase reversal; 

9 iii) determining the time interval between the location of the first 

10 phase reversal in the signal and the second phase reversal in the 

1 1 voice signal; and 

12 iv) indicating that the voice signal includes a modulated data signal 

13 when the determined time interval is between a predetermined 

1 4 range of time values. 



1 2. The method according to claim 1 , wherein step b) i) includes the steps of: 

2 a) performing an autocorrelation of a segment of the voice signal; 

3 b) searching for the lowest value of the autocorrelation of the segment of 

4 the voice signal; 

5 c) comparing the determined lowest value of the autocorrelation to a 

6 predetermined value; and 

7 d) indicating a phase reversal has occurred when the determined lowest 

8 value is less than the predetermined value. 
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1 3. The method according to claim 1, wherein step b) further includes the steps of: 

2 a) when the determined time interval is not between the predetermined 

3 range of time values, performing the steps of; 

4 i) locating a third phase reversal in the voice signal, the third 

5 phase signal being the next consecutive phase reversal in the 

6 voice signal after the second phase reversal; 

7 ii) determining the time interval between the location of the second 

8 phase reversal in the signal and the third phase reversal in the 

9 voice signal; 

10 iii) indicating that the voice signal includes a modulated data signal 

11 when the determined time interval between the location of the 

12 second phase reversal in the signal and the third phase reversal 

13 in the voice signal is between the predetermined range of time 

14 values; and 

15 iv) indicating that the voice signal does not include a modulated 

16 data signal when the determined time interval between the 

17 location of the second phase reversal in the signal and the third 

18 phase reversal in the voice signal is not between the 

19 predetermined range of time values. 

1 4. The method according to claim 1, wherein step a) includes the steps of: 

2 a) selecting a segment of the voice signal; 

3 b) converting the segment of the voice signal from a time domain signal to 

4 a frequency domain signal; 

5 c) comparing the energy of a bin of the frequency domain signal that 

6 corresponds to the frequency of the answer tone with adjacent 

7 frequency bins; and 

8 d) indicating that the answer tone is present in the segment of the voice 

9 signal when the energy of the frequency bin corresponding to the 

10 frequency of the answer tone is greater than the energy of the adjacent 

1 1 frequency bins. 
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1 5. The method according to claim 4, wherein step a) includes the steps of: 

2 a) selecting a segment of the voice signal; and 

3 b) down sampling the selected segment of the voice signal. 

1 6. A method of packetizing a digital voice signal for transmission over a digital 

2 voice network where the voice signal may include a modulated data signal, the 

3 method comprising the steps of: 

4 a) determining whether an answer tone is present in the voice signal; 

5 b) when an answer tone is present, performing the steps of: 

6 i) locating a first phase reversal in the voice signal; 

7 ii) locating a second phase reversal in the voice signal, the second 

8 phase signal being the next consecutive phase reversal in the 

9 voice signal after the first phase reversal; 

10 iii) determining the time interval between the location of the first 

1 1 phase reversal in the signal and the second phase reversal in the 

12 voice signal; and 

13 iv) when the determined time interval is between a predetermined 

14 range of time values performing the steps of: 

15 a) demodulating the voice signal to generate a digital data 

16 signal; 

17 b) packetizing the digital data signal into a plurality of 

18 digital packets for transmission over the digital voice 

19 network. 
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1 7. The method according to claim 6, wherein step b) i) includes the steps of: 

2 a) performing an autocorrelation of a segment of the voice signal; 

3 b) searching for the lowest value of the autocorrelation of the segment of 

4 the voice signal; 

5 c) comparing the determined lowest value of the autocorrelation to a 

6 predetermined value; and 

7 d) indicating a phase reversal has occurred when the determined lowest 

8 value is less than the predetermined value. 

1 8. The method according to claim 6, wherein step b) further includes the steps of: 

2 a) when the determined time interval is not between the predetermined 

3 range of time values, performing the steps of; 

4 i) locating a third phase reversal in the voice signal, the third 

5 phase signal being the next consecutive phase reversal in the 

6 voice signal after the second phase reversal; 

7 ii) determining the time interval between the location of the second 

8 phase reversal in the signal and the third phase reversal in the 

9 voice signal; 

10 iii) when the determined time interval is between the predetermined 

1 1 range of time values performing the steps of: 

12 a) demodulating the voice signal to generate a digital data 

13 signal; 

14 b) packetizing the digital data signal into a plurality of 

15 digital packets for transmission over the digital voice 

16 network; and 

17 iv) indicating that the voice signal does not include a modulated 

18 data signal when the determined time interval between the 

19 location of the second phase reversal in the signal and the third 

20 phase reversal in the voice signal is not between the 

2 1 predetermined range of time values. 
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1 9. The method according to claim 6, wherein step b) further includes the steps of: 

2 a) when the determined time interval is not between the predetermined 

3 range of time values, performmg the steps of; 

4 i) locating a third phase reversal in the voice signal, the third 

5 phase signal being the next consecutive phase reversal in the 

6 voice signal after the second phase reversal; 

7 ii) determining the time interval between the location of the second 

8 phase reversal in the signal and the third phase reversal in the 

9 voice signal; 

10 iii) when the determined time interval between the location of the 

1 1 second phase reversal in the signal and the third phase reversal 

12 is between the predetermined range of time values performing 

13 the steps of: 

14 a) demodulating the voice signal to generate a digital data 

15 signal; 

16 b) packetizing the digital data signal into a plurality of 

17 digital packets for transmission over the digital voice 

18 network; and 

19 iv) when the determined time interval between the location of the 

20 second phase reversal in the signal and the third phase reversal 

21 is not between the predetermined range of time values 

22 performing the steps of: 

23 a) compressing the voice signal; and 

24 b) packetizing the compressed voice signal into a plurality 

25 of digital packets for transmission over the digital voice 

26 network. 
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The method according to claim 6, wherein step a) includes the steps of: 

a) selecting a segment of the voice signal; 

b) converting the segment of the voice signal from a time domain signal to 
a frequency domain signal; 

c) comparing the energy of a bin of the frequency domain signal that 
corresponds to the frequency of the answer tone with adjacent 
frequency bins; and 

d) indicating that the answer tone is present in the segment of the voice 
signal when the energy of the frequency bin corresponding to the 
frequency of the answer tone is greater than the energy of the adjacent 
frequency bins. 

The method according to claim 10, wherein step a) includes the steps of: 

a) selecting a segment of the voice signal; and 

b) down sampUng the selected segment of the voice signal. 
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1 12. A method of packetizing a digital voice signal for transmission over a digital 

2 voice network where the voice signal may include a modulated data signal, the 

3 method comprising the steps of: 

4 a) determining whether an answer tone is present in the voice signal; and 

5 b) when an answer tone is present, performing the steps of: 

6 i) locating a first phase reversal in the voice signal; 

7 ii) locating a second phase reversal in the voice signal, the second 

8 phase signal being the next consecutive phase reversal in the 

9 voice signal after the first phase reversal; 

10 iii) determining the time interval between the location of the first 

1 1 phase reversal in the signal and the second phase reversal in the 

12 voice signal; and 

13 iv) when the determined time interval is between a predetermined 

14 range of time values performing the steps of: 

15 a) encoding the voice signal into a digital data signal 

16 comprising the modulated data signal by linearly 

17 quantizing the voice signal; and 

18 b) packetizing the coded voice signal into a plurality of 

19 digital packets for transmission over the digital voice 

20 network. 



1 13. The method according to claim 1 2, wherein step b) i) includes the steps of: 

2 a) performing an autocorrelation of a segment of the voice signal; 

3 b) searching for the lowest value of the autocorrelation of the segment of 

4 the voice signal; 

5 c) comparing the determined lowest value of the autocorrelation to a 

6 predetermined value; and 

7 d) indicating a phase reversal has occurred when the determined lowest 

8 value is less than the predetermined value. 
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1 14. The method according to claim 12, wherein step b) further includes the steps 

2 of: 

3 a) when the determined time interval is not between the predetermined 

4 range of time values, performing the steps of; 

5 i) locating a third phase reversal in the voice signal, the third 

6 phase signal being the next consecutive phase reversal in the 

7 voice signal after the second phase reversal; 

8 ii) determining the time interval between the location of the second 

9 phase reversal in the signal and the third phase reversal in the 

10 voice signal; 

1 1 iii) when the determined time interval is between the predetermined 

12 range of time values performing the steps of: 

13 a) encoding the voice signal into a digital data signal 

14 comprising the modulated data signal by linearly 

1 5 quantizing the voice signal; and 

16 b) packetizing the coded voice signal mto a plurality of 

17 digital packets for transmission over the digital voice 

18 network; and 

19 iv) indicating that the voice signal does not include a modulated 

20 data signal when the determined time interval between the 

21 location of the second phase reversal in the signal and the third 

22 phase reversal in the voice signal is not between the 

23 predetermined range of time values. 
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1 15. The method according to claim 12, wherein step b) further includes the steps 

2 of: 

3 a) when the determined time interval is not between the predetermined 

4 range of time values, performing the steps of; 

5 i) locating a third phase reversal in the voice signal, the third 

6 phase signal being the next consecutive phase reversal in the 

7 voice signal after the second phase reversal; 

8 ii) determining the time interval between the location of the second 

9 phase reversal in the signal and the third phase reversal in the 

10 voice signal; 

1 1 iii) when the determined time interval between the location of the 

12 second phase reversal in the signal and the third phase reversal 

13 is between the predetermined range of time values performing 

14 the steps of: 

15 a) encoding the voice signal into a digital data signal 

16 comprising the modulated data signal by linearly 

1 7 quantizing the voice signal; 

18 b) packetizing the coded voice signal into a plurality of 

19 digital packets for transmission over the digital voice 

20 network; and 

21 iv) when the determined time interval between the location of the 

22 second phase reversal in the signal and the third phase reversal 

23 is not between the predetermined range of time values 

24 performing the steps of: 

25 a) compressing the voice signal; and 

26 b) packetizing the compressed voice signal into a plurality 

27 of digital packets for transmission over the digital voice 

28 network. 
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1 16. The method according to claim 12, wherein step a) includes the steps of: 

2 a) selecting a segment of the voice signal; 

3 b) converting the segment of the voice signal from a time domain signal to 

4 a frequency domain signal; 

5 c) comparing the energy of a bin of the frequency domain signal that 

6 corresponds to the frequency of the answer tone with adjacent 

7 frequency bms; and 

8 d) indicating that the answer tone is present in the segment of the voice 

9 signal when the energy of the frequency bin corresponding to the 

10 frequency of the answer tone is greater than the energy of the adjacent 

1 1 frequency bins. 

1 17. The method according to claim 16, wherein step a) includes the steps of: 

2 a) selecting a segment of the voice signal; and 

3 b) down sampling the selected segment of the voice signal. 

1 18. A method of transmitting a digital voice signal over a digital voice network 

2 where the voice signal may include a modulated data signal, the method 

3 comprising the steps of: 

4 a) determining whether an answer tone is present in the voice signal; 

5 b) when an answer tone is present, performing the steps of: 

6 i) locating a first phase reversal in the voice signal; 

7 ii) locating a second phase reversal in the voice signal, the second 

8 phase signal being the next consecutive phase reversal in the 

9 voice signal after the first phase reversal; 

10 iii) determining the time interval between the location of the first 

1 1 phase reversal in the signal and the second phase reversal in the 

12 voice signal; and 

13 iv) when the determined time interval is between a predetermined 

14 range of time values performing the steps of: 
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15 




a) demodulating the voice signal to generate a digital data 


16 




signal; 


17 




b) packetizing the digital data signal into a plurality of 


18 




digital packets for transmission over the digital voice 


19 




network 


20 


c) 


transmitting the digital packets across the network; 


21 


d) 


converting the digital packets into a received digital signal; 


22 


e) 


determining whether the received signal includes a digital data signal; 


23 


f) 


when the received digital signal includes a digital data signal 


24 




modulating the received digital signal into a modulated data signal. 


1 19. 


The method according to claim 1 8, wherein step b) i) includes the steps of: 


2 


a) 


performing an autocorrelation of a segment of the voice signal; 


3 


b) 


searching for the lowest value of the autocorrelation of the segment of 


4 




the voice signal; 


5 


c) 


comparing the determined lowest value of the autocorrelation to a 


6 




predetermined value; and 


7 


d) 


indicating a phase reversal has occurred when the determined lowest 


g 




value is less than the predetermined value. 


1 20. 


The method according to claim 18, wherein step b) further includes the steps 


2 


of: 




3 


a) 


when the determined time interval is not between the predetermined 


4 




range of time values, performing the steps of; 


5 




i) locating a third phase reversal in the voice signal, the third 


6 




phase signal being the next consecutive phase reversal in the 


7 




voice signal after the second phase reversal; 


8 




ii) determining the time interval between the location of the second 


9 




phase reversal in the signal and the third phase reversal in the 


10 




voice signal; 
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1 1 iii) when the determined time interval is between the predetermined 

12 range of time values performing the steps of: 

13 a) demodulating the voice signal to generate a digital data 

14 signal; 

15 b) packetizing the digital data signal into a plurality of 

16 digital packets for transmission over the digital voice 

17 network; and 

18 iv) indicating that the voice signal does not include a modulated 

19 data signal when the determined time interval between the 

20 location of the second phase reversal in the signal and the third 

21 phase reversal in the voice signal is not between a 

22 predetermined range of time values. 

1 21. The method according to claim 18, wherein step b) further includes the steps 

2 of: 

3 a) when the determined time interval is not between the predetermined 

4 range of time values, performing the steps of; 

5 i) locating a third phase reversal in the voice signal, the third 

6 phase signal being the next consecutive phase reversal in the 

7 voice signal after the second phase reversal; 

8 ii) determining the time interval between the location of the second 

9 phase reversal in the signal and the third phase reversal in the 

10 voice signal; 

1 1 iii) when the determined time interval between the location of the 

12 second phase reversal in the signal and the third phase reversal 

13 is between the predetermined range of time values performing 

14 the steps of: 

15 a) demodulating the voice signal to generate a digital data 

16 signal; 
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17 b) packetizing the digital data signal into a plurality of 

18 digital packets for transmission over the digital voice 

19 network; and 

20 iv) when the determined time interval between the location of the 

21 second phase reversal in the signal and the third phase reversal 

22 is not between the predetermined range of time values 

23 performing the steps of: 

24 a) compressing the voice signal; and 

25 b) packetizing the compressed voice signal into a plurality 

26 of digital packets for transmission over the digital voice 

27 network; 

28 and wherein the method further includes the step of when the received digital 

29 signal does not include a digital data signal decompressing the received digital 

30 signal into a voice signal. 

1 22. The method according to claim 1 8, wherein step a) includes the steps of: 

2 a) selecting a segment of the voice signal; 

3 b) converting the segment of the voice signal from a time domain signal to 

4 a frequency domain signal; 

5 c) comparing the energy of a bin of the frequency domain signal that 

6 corresponds to the frequency of the answer tone with adjacent 

7 frequency bins; and 

8 d) indicating that the answer tone is present in the segment of the voice 

9 signal when the energy of the frequency bin corresponding to the 

10 frequency of the answer tone is greater than the energy of the adjacent 

1 1 frequency bins, 

1 23. The method according to claim 22, wherein step a) includes the steps of: 

2 a) selecting a segment of the voice signal; and 

3 b) down sampling the selected segment of the voice signal 
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1 24, A method of transmitting a digital voice signal over a digital voice network 

2 where the voice signal may include a modulated data signal, the method 

3 comprising the steps of: 

4 a) determining whether an answer tone is present in the voice signal; 

5 b) when an answer tone is present, performing the steps of: 

6 i) locating a first phase reversal in the voice signal; 

7 ii) locating a second phase reversal in the voice signal, the second 

8 ' phase signal being the next consecutive phase reversal in the 

9 voice signal after the first phase reversal; 

10 iii) determining the time interval between the location of the first 

1 1 phase reversal in the signal and the second phase reversal in the 

12 voice signal; and 

13 iv) when the determined time interval is between a predetermined 

14 range of time values performing the steps of: 

15 a) encoding the voice signal into a digital data signal 

16 comprising the modulated data signal by linearly 

1 7 quantizing the voice signal ; 

18 b) packetizing the coded voice signal into a plurality of 

19 digital packets for transmission over the digital voice 

20 network 

2 1 c) transmitting the digital packets across the network; 

22 d) converting the digital packets into a received digital signal; 

23 e) determining whether the received signal includes a coded voice signal; 

24 and 

25 f) when the received digital signal includes a encoded data signal 

26 decoding the received digital signal into a modulated voice signal by 

27 dequantizing the received digital signal. 
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1 25. The method according to claim 24, wherein step b) i) includes the steps of: 

2 a) performing an autocorrelation of a segment of the voice signal; 

3 b) searching for the lowest value of the autocorrelation of the segment of 

4 the voice signal; 

5 c) comparing the determined lowest value of the autocorrelation to a 

6 predetermined value; and 

7 d) indicating a phase reversal has occurred when the determined lowest 

8 value is less than the predetermined value. 

1 26. The method according to claim 24, wherein step b) further includes the steps 

2 of: 

3 a) when the determined time interval is not between the predetermined 

4 range of time values, performing the steps of; 

5 i) locating a third phase reversal in the voice signal, the third 

6 phase signal being the next consecutive phase reversal in the 

7 voice signal after the second phase reversal; 

8 ii) determining the time interval between the location of the second 

9 phase reversal in the signal and the third phase reversal in the 

10 voice signal; 

1 1 iii) when the determined time interval is between the predetermined 

12 range of time values performing the steps of: 

13 a) encoding the voice signal into a digital data signal 

14 comprising the modulated data signal by Unearly 

1 5 quantizing the voice signal; 

16 b) packetizing the coded voice signal into a plurality of 

17 digital packets for transmission over the digital voice 

18 network; and 
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19 iv) indicating that the voice signal does not include a modulated 

20 data signal when the determined time interval between the 

21 location of the second phase reversal in the signal and the third 

22 phase reversal in the voice signal is not between the 

23 predetermined range of time values. 

1 27. The method according to claim 24, wherein step b) further includes the steps 

2 of: 

3 a) when the determined time interval is not between the predetermined 

4 range of time values, performing the steps of; 

5 i) locating a third phase reversal in the voice signal, the third 

6 phase signal being the next consecutive phase reversal in the 

7 voice signal after the second phase reversal; 

8 ii) determining the time interval between the location of the second 

9 phase reversal in the signal and the third phase reversal in the 

10 voice signal; 

1 1 iii) when the determined time interval between the location of the 

12 second phase reversal in the signal and the third phase reversal 

13 is between the predetermined range of time values performing 

14 the steps of: 

15 a) encoding the voice signal into a digital data signal 

16 comprising the modulated data signal by linearly 

1 7 quantizing the voice signal ; 

18 b) packetizing the coded voice signal into a plurality of 

19 digital packets for transmission over the digital voice 

20 network; and 

21 iv) when the determined time interval between the location of the 

22 second phase reversal in the signal and the third phase reversal 

23 is not between the predetermined range of time values 

24 performing the steps of: 

25 a) compressing the voice signal; and 
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26 b) packetizing the compressed voice signal into a plurality 

27 of digital packets for transmission over the digital voice 

28 network; 

29 and wherein the method further includes the step of when the received digital 

30 signal does not include a encoded data signal decompressing the received 

3 1 digital signal into a voice signal. 

1 28. The method according to claim 24, wherein step a) includes the steps of: 

2 a) selecting a segment of the voice signal; 

3 b) converting the segment of the voice signal from a time domain signal to 

4 a frequency domain signal; 

5 c) comparing the energy of a bin of the frequency domain signal that 

6 corresponds to the frequency of the answer tone with adjacent 

7 frequency bins; and 

8 d) indicating that the answer tone is present in the segment of the voice 

9 signal when the energy of the frequency bin corresponding to the 

10 frequency of the answer tone is greater than the energy of the adjacent 

1 1 frequency bins. 

1 29. The method according to claim 28, wherein step a) includes the steps of: 

2 a) selecting a segment of the voice signal; and 

3 b) down sampling the selected segment of the voice signal. 

1 30. The method according to claim 12, wherein step b) iv) a) includes the steps of: 

2 a) normalizing the voice signal; and 

3 b) converting the normalized voice signal into a digital floating point 

4 signal. 
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1 3 L The method according to claim 30, wherein step a) includes the steps of: 

2 a) finding the sample of the voice signal having the maximum value; and 

3 b) normalizing the sample of the voice signal having the maximum value 

4 by left shifting the sample until all redundant sign bits are eliminated 

5 where NLS is the number of left shifts; and 

6 c) right shifting all the samples of the voice signal as a function of NLS. 

1 32. The method according to claim 31, wherein step c) includes the step of right 

2 shifting all the samples of the voice signal as a fimction of NLS and the bit 

3 precision of the mantissa of the floating point signal. 

1 33. The method according to claim 31, wherein step c) includes the step of right 

2 shifting all the samples of the voice signal by 16 - NLS - N_prec where 

3 Njprec is the bit precision of the mantissa of the floating point signal. 

1 34. The method according to claim 33, wherein Njprec is set to 4 for 24.8 bits per 

2 second (BPS) data signals, 5 for 30.8 BPS data signals, 6 for 36.8 BPS data 

3 signals, and 7 for 42.8 BPS data signals. 

1 35. The method according to claim 30, further including the step of downsampling 

2 the voice signal from a first rate to a second lower rate. 

1 36. The method according to claim 24, whereui step b) iv) a) includes the steps of: 

2 a) normalizing the voice signal; and 

3 b) converting the normalized voice signal into a digital floating point 

4 signal. 
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1 37. The method according to claim 36, wherein step a) includes the steps of: 

2 a) finding the sample of the voice signal having the maximum value; and 

3 b) normalizing the sample of the voice signal having the maximum value 

4 by left shifting the sample until all redundant sign bits are eliminated 

5 where NLS is the number of left shifts; and 

6 c) right shifting all the samples of the voice signal as a fimction of NLS. 

1 38. The method according to claim 37, wherein step c) includes the step of right 

2 shifting all the samples of the voice signal as a function of NLS and the bit 

3 precision of the mantissa of the floating point signal 

1 39. The method according to claim 37, wherein step c) includes the step of right 

2 shifting all the samples of the voice signal by 16 - NLS - N_prec where 

3 N_prec is the bit precision of the mantissa of the floating point signal. 

1 40. The method according to claim 39, wherein Njrec is set to 4 for 24.8 bits per 

2 second (BPS) data signals, 5 for 30.8 BPS data signals, 6 for 36.8 BPS data 

3 signals, and 7 for 42.8 BPS data signals. 

1 41. The method according to claim 36, fiirther including the step of downsampling 

2 the voice signal from a first rate to a second lower rate, 

1 42. A method of packetizing a digital voice signal for transmission over a digital 

2 voice network where the voice signal may include a modulated data signal, the 

3 method comprising the steps of: 

4 a) determining whether the digital voice signal includes a modulated data 

5 signal; and 

6 b) when the digital voice signal includes a modulated data signal, 

7 performing the steps of: 
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i) encoding the voice signal into a digital data signal comprising 
the modulated data signal by linearly quantizing the voice 
signal; and 

ii) packetizing the coded voice signal into a plurality of digital 
packets for transmission over the digital voice network. 

The method according to claim 42, wherein step b) i) includes the steps of: 

a) normalizing the voice signal; and 

b) converting the normalized voice signal into a digital floating point 
signal. 

The method according to claim 43, wherein step a) includes the steps of: 

a) finding the sample of the voice signal havuig the maximum value; and 

b) normalizing the sample of the voice signal having the maximum value 
by left shifting the sample until all redundant sign bits are eliminated 
where NLS is the number of left shifts; and 

c) right shifting all the samples of the voice signal as a function of NLS. 

The method according to claim 44, wherein step c) includes the step of right 
shifting all the samples of the voice signal as a fimction of NLS and the bit 
precision of the mantissa of the floating point signal. 

The method according to claim 44, wherein step c) includes the step of right 
shifting all the samples of the voice signal by 16 - NLS - N_prec where 
N_prec is the bit precision of the mantissa of the floating point signal. 

The method according to claim 46, wherein N_prec is set to 4 for 24.8 bits per 
second (BPS) data signals, 5 for 30.8 BPS data signals, 6 for 36.8 BPS data 
signals, and 7 for 42.8 BPS data signals. 
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1 48. The method according to claim 43, further including the step of downsampling 

2 the voice signal from a first rate to a second lower rate. 



1 49. 


A method of transmitting a digital voice signal over a digital voice network 


2 


where the voice signal may include a modulated data signal, the method 


3 


comprising the steps of: 


4 


a) 


determining whether the digital voice signal includes a modulated data 


5 




signal; 


6 


b) 


when the digital voice signal includes a modulated data signal, 


7 




performing the steps of: 


8 




i) encoding the voice signal into a digital data signal comprising 


9 




the modulated data signal by linearly quantizing the voice 


10 




signal; 


11 




ii) packetizing the coded voice signal into a plurality of digital 


12 




packets for transmission over the digital voice network 


13 


c) 


transmitting the digital packets across the network; 


14 


d) 


converting the digital packets into a received digital signal; 


15 


e) 


determining whether the received signal includes a coded voice signal; 


16 




and 


17 


f) 


when the received digital signal includes a encoded data signal 


1 o 




decoding the received digital signal into a modulated voice signal by 


19 




dequantizing the received digital signal. 


1 50. 


The method according to claim 49, wherein step b) i) includes the steps of: 


2 


a) 


normalizing the voice signal; and 


3 


b) 


converting the normalized voice signal into a digital floating point 


4 




signal. 
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1 51. The method according to claim 50, wherein step a) includes the steps of: 

2 a) finding the sample of the voice signal having the maximum value; and 

3 b) normalizing the sample of the voice signal having the maximum value 

4 by left shifting the sample imtil all redundant sign bits are eliminated 

5 where NLS is the number of left shifts; and 

6 c) right shifting all the samples of the voice signal as a fimction of NLS. 

1 52, The method according to claim 51, wherein step c) includes the step of right 

2 shifting all the samples of the voice signal as a fiinction of NLS and the bit 

3 precision of the mantissa of the floating point signal. 

1 53. The method according to claim 51, wherein step c) includes the step of right 

2 shifting all the samples of the voice signal by 16 - NLS - Njprec where 

3 Njprec is the bit precision of the mantissa of the floating point signal. 

1 54. The method according to claim 53, wherein N jrec is set to 4 for 24.8 bits per 

2 second (BPS) data signals, 5 for 30.8 BPS data signals, 6 for 36.8 BPS data 

3 signals, and 7 for 42.8 BPS data signals. 

1 55. The method according to claim 50, fiirther including the step of downsampling 

2 the voice signal from a first rate to a second lower rate. 
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ABSTRACT 



A method and apparatus of transmitting a modulated data over a digital voice network 
that includes voice compression. The method and apparatus determine whether a voice 

5 signal to be transmitted over the network includes a modulated data signal and then 

encodes or demodulates the modulated data signal and bypasses voice compression 
when a modulated data signal is detected. The method and apparatus may detect a 
modulated data signal by detecting the presence of an answer tone and then searching 
for a plurality of phase reversals having a particular time distance from each other in 

10 the voice signal. Once the encoded or demodulated signal is transmitted over the 
digital voice network, the method and apparatus of the present invention may detect 
whether the received signal is an encoded data signal or a demodulated data signal. 
Then the coded data signal may be decoded back into a modulated data signal. 
Likewise, the demodulated data signal may be modulated back into a modulated data 

15 signal. In one embodiment, the invention encodes voice signals including modulated 

data signals by linearly quantizing the voice signals. 
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